Start 



$ 
CO 

rii 
ru 
m 

M 

u 
U) 
W 

□ 
a 







Text input 
205 




r 


Text analysis 
210 




f 


Linguistic and prosodic 
analysis 
215 




r 


Output generation 
220 




r 



Figure 2 



End 



□ 

m 

fii 
ru 

If! 

4; 

SI 

□ 

Ul 

Ul 

□ 
a 



Text 
305 



V 



Text analysis 
module 
310 



V 



Linguistic and 
prosodic analysis 
315 



V 



Waveform 
synthesis 
320 



V 



Digital Audio 
Output 
330 



Text-to-speech system 
300 



Concatenative 
diphone database 
325 



Figure 3 



Total Size: 6.1MB 
435 



□ 

Mi 

ru 

m 
m 

Si 

o 
w 

w 
o 

Q 



Diphone 
waveforms 
(5.1MB) 
405 



LPC coefficients 
(800kB) 
410 



Pitch marks 
(300kB) 
415 




Total Size: 550kB 
440 



Compressed diphone 
residuals and LPC 
coefficients (250kB) 
420 



Pitch marks (300kB) 
430 



Figure 4 









N 


o 


*« in 


Wave 


synthe 
54 



7^ 7^ 



to 

3 
O) 



■o 

8 

■o 

CO 

CVJ O 




g> S 

> -Q 

co iS 

C 03 o 
Q> "D t- 

E iSc" 
o c ° 

o 8f 



w 
w 

Q_ "J 



CD 

8 

C O 

CO ^ 

r- 

O 



| £ g 

b ss 



1: 



0) £ 

Is 

CD iS 

5; 01 o 

0J T3 o 

o2 

O Q. 
T3 



c 



Start 



Receive diphone waveforms 
605 



4 

Compress the diphone waveforms into 
diphone residuals using an encoder 
610 



Generate LPC cefficients 
615 



Hi 

ru 
ru 

If! 

M 

O 
UJ 

lij 

Q 
O 



Store the encoder-generated 
compressed packets 
620 



Request the diphone residuals 
625 




r 


Locate the requested diphone 
residuals 
630 




r 



Extract the located diphone residuals 
635 



Decompress the diphone residuals 
using a decoder 
640 




r 


Supplying the residuals, LPC 
coefficients, and pitch marks to a 
waveform synthesizer 
645 




r 



Figure 6 



c 



End 



